NextClip: an analysis and read preparation tool for Nextera Long Mate Pair libraries

نویسندگان

  • Richard M. Leggett
  • Bernardo J. Clavijo
  • Leah Clissold
  • Matthew D. Clark
  • Mario Caccamo
چکیده

SUMMARY Illumina's recently released Nextera Long Mate Pair (LMP) kit enables production of jumping libraries of up to 12 kb. The LMP libraries are an invaluable resource for carrying out complex assemblies and other downstream bioinformatics analyses such as the characterization of structural variants. However, LMP libraries are intrinsically noisy and to maximize their value, post-sequencing data analysis is required. Standardizing laboratory protocols and the selection of sequenced reads for downstream analysis are non-trivial tasks. NextClip is a tool for analyzing reads from LMP libraries, generating a comprehensive quality report and extracting good quality trimmed and deduplicated reads. AVAILABILITY AND IMPLEMENTATION Source code, user guide and example data are available from https://github.com/richardmleggett/nextclip/.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sequence analysis Assembling short reads from jumping libraries with large insert sizes

Motivation: Advances in Next-Generation Sequencing technologies and sample preparation recently enabled generation of high-quality jumping libraries that have a potential to significantly improve short read assemblies. However, assembly algorithms have to catch up with experimental innovations to benefit from them and to produce high-quality assemblies. Results: We present a new algorithm that ...

متن کامل

Assembling short reads from jumping libraries with large insert sizes

MOTIVATION Advances in Next-Generation Sequencing technologies and sample preparation recently enabled generation of high-quality jumping libraries that have a potential to significantly improve short read assemblies. However, assembly algorithms have to catch up with experimental innovations to benefit from them and to produce high-quality assemblies. RESULTS We present a new algorithm that ...

متن کامل

Optimization and cost-saving in tagmentation-based mate-pair library preparation and sequencing.

In de novo genome sequencing, mate-pair reads are crucial for scaffolding assembled contigs. However, preparation of mate-pair libraries is not a trivial task, even when using one of the latest approaches, the Nextera Mate Pair Sample Prep Kit from Illumina. To reduce cost and enhance library yield and fidelity when using this kit, we have modified the manufacturer's protocol based on (i) varia...

متن کامل

NxRepair: error correction in de novo sequence assembly using Nextera mate pairs

Scaffolding errors and incorrect repeat disambiguation during de novo assembly can result in large scale misassemblies in draft genomes. Nextera mate pair sequencing data provide additional information to resolve assembly ambiguities during scaffolding. Here, we introduce NxRepair, an open source toolkit for error correction in de novo assemblies that uses Nextera mate pair libraries to identif...

متن کامل

Sequence analysis NxTrim: optimized trimming of Illumina mate pair reads

Motivation: Mate pair protocols add to the utility of paired-end sequencing by boosting the genomic distance spanned by each pair of reads, potentially allowing larger repeats to be bridged and resolved. The Illumina Nextera Mate Pair (NMP) protocol uses a circularization-based strategy that leaves behind 38-bp adapter sequences, which must be computationally removed from the data. While ‘adapt...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 30  شماره 

صفحات  -

تاریخ انتشار 2014